منابع مشابه
Error correction for massive datasets
The paper is concerned with the problem of automatic detection and correction of errors into massive data sets. As customary, erroneous data records are detected by formulating a set of rules. Such rules are here encoded into linear inequalities. This allows to check the set of rules for inconsistencies and redundancies by using a polyhedral mathematics approach. Moreover, it allows to correct ...
متن کاملIssues in preprocessing current datasets for grammatical error correction
In this report, we describe some of the issues encountered when preprocessing two of the largest datasets for Grammatical Error Correction (GEC); namely the public FCE corpus and NUCLE (along with associated CoNLL test sets). In particular, we show that it is not straightforward to convert character level annotations to token level annotations and that sentence segmentation is more complex when...
متن کاملSpatial Prediction for Massive Datasets
Remotely sensed spatio-temporal datasets on the order of megabytes to terrabytes are becoming more common. For example, polar-orbiting satellites observe Earth from space, monitoring the Earth’s atmospheric, oceanic, and terrestrial processes, and generate massive amounts of environmental data. The current generation of satellites, such as the National Aeronautic and Space Administration’s (NAS...
متن کاملMassive Datasets in Astronomy
Astronomy has a long history of acquiring, systematizing, and interpreting large quantities of data. Starting from the earliest sky atlases through the first major photographic sky surveys of the 20th century, this tradition is continuing today, and at an ever increasing rate. Like many other fields, astronomy has become a very data-rich science, driven by the advances in telescope, detector, a...
متن کاملTyping Massive JSON Datasets
Cloud-specific languages are usually untyped, and no guarantees about the correctness of complex jobs can be statically obtained. Datasets too are usually untyped and no schema information is needed for their manipulation. In this paper we sketch a typing algorithm for JSON datasets. Our approach can be used to infer a succinct type from scratch for a collection of JSON objects, as well as to v...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Optimization Methods and Software
سال: 2005
ISSN: 1055-6788,1029-4937
DOI: 10.1080/10556780512331318281